Information-theoretic measures of predictability for music content analysis
نویسنده
چکیده
This thesis is concerned with determining similarity in musical audio, for the purpose of applications in music content analysis. With the aim of determining similarity, we consider the problem of representing temporal structure in music. To represent temporal structure, we propose to compute information-theoretic measures of predictability in sequences. We apply our measures to track-wise representations obtained from musical audio; thereafter we consider the obtained measures predictors of musical similarity. We demonstrate that our approach benefits music content analysis tasks based on musical similarity. For the intermediate-specificity task of cover song identification, we compare contrasting discrete-valued and continuous-valued measures of pairwise predictability between sequences. In the discrete case, we devise a method for computing the normalised compression distance (NCD) which accounts for correlation between sequences. We observe that our measure improves average performance over NCD, for sequential compression algorithms. In the continuous case, we propose to compute information-based measures as statistics of the prediction error between sequences. Evaluated using 300 Jazz standards and using the Million Song Dataset, we observe that continuous-valued approaches outperform discrete-valued approaches. Further, we demonstrate that continuous-valued measures of predictability may be combined to improve performance with respect to baseline approaches. Using a filter-and-refine approach, we demonstrate state-of-the-art performance using the Million Song Dataset. For the low-specificity tasks of similarity rating prediction and song year prediction, we propose descriptors based on computing track-wise compression rates of quantised audio features, using multiple temporal resolutions and quantisation granularities. We evaluate our descriptors using a dataset of 15 500 track excerpts of Western popular music, for which we have 7 800 web-sourced pairwise similarity ratings. Combined with bag-of-features descriptors, we obtain performance gains of 31.1% and 10.9% for similarity rating prediction and song year prediction. For both tasks, analysis of selected descriptors reveals that representing features at multiple time scales benefits prediction accuracy.
منابع مشابه
Predicting Human Reactions to Music on the Basis of Similarity Structure and Information Theoretic Measures of the Sound Signal
Memory, repetition, and anticipatory structure are considered important characteristics of musical style. Both composers and listeners often refer to these parameters in describing the music. In this work we conducted audio analyses in an attempt to determine correlations between audio features and human responses of Familiarity and Emotional Force. The analyses were performed on two recordings...
متن کاملThe Melody Triangle: Exploring Pattern and Predictability in Music
The Melody Triangle is an interface for the discovery of melodic materials, where the input – positions within a triangle – directly map to information theoretic properties of the output. A model of human expectation and surprise in the perception of music, information dynamics, is used to ‘map out’ a musical generative system’s parameter space. This enables a user to explore the possibilities ...
متن کاملAnalaysis of IFLA Library Refrence Model’s Entities and Attrbutes For Iranian Traditional Music Resources (Case study: Morq-e sahar Song)
Background and Aim: The object of the study was to Analyze IFLA Library Reference Model (LRM) Entities and Attributes for Iranian Traditional Music Resources, Case Study: Morq-e Sahar Song. Method: The study inherits an applied content analysis method. All Entities and Attributes of IFlA LRM Model based on two checklists include: Final report of IFlA LRM on August 2017 and Transition Mappi...
متن کاملFeature Selection Using Multi Objective Genetic Algorithm with Support Vector Machine
Different approaches have been proposed for feature selection to obtain suitable features subset among all features. These methods search feature space for feature subsets which satisfies some criteria or optimizes several objective functions. The objective functions are divided into two main groups: filter and wrapper methods. In filter methods, features subsets are selected due to some measu...
متن کاملشناسایی خودکار سبک موسیقی
Nowadays, automatic analysis of music signals has gained a considerable importance due to the growing amount of music data found on the Web. Music genre classification is one of the interesting research areas in music information retrieval systems. In this paper several techniques were implemented and evaluated for music genre classification including feature extraction, feature selection and m...
متن کامل